Farsi language prosodic structure, research and implementation using a speech synthesizer

نویسندگان

  • Hamid Sheikhzadeh
  • A. Eshkevari
  • M. Khayatian
  • Mohammad Reza Sadigh
  • Seyed Mohammad Ahadi
چکیده

In this research, we have investigated about prosodic features of Farsi (Persian) language and quantified major stress rules and some intonation rules for speech synthesis purpose. The research is mostly concentrated on pitch variations and then on durational changes. We have implemented the proposed simplified prosodic rules using a Klatt formant synthesizer, specially modified for Farsi phonemes. In order to achieve to a better speech quality, we have exploited different allophonic forms for some consonants, leading to a total of 207 Farsi diphones synthesized by the speech synthesizer. Subjective listening tests show that the addition of the prosodic features drastically increases both the intelligibility and naturalness of the synthesized speech. The synthesizer is software-implemented on a Pentium PC and operates in real-time.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Implementation of a text-to-speech system for farsi language

In this research, a Text-To-Speech system for Farsi language has been implemented. The proposed synthesizer concatenates Farsi syllables in a TD-PSOLA manner. This paper is mainly concentrated on investigation about pitch variations in Farsi sentences and presentation of some novel rules for modeling these variations. Based on the location of stressed syllable, we obtain a primary pitch curve f...

متن کامل

Prosodic elements to improve pronunciation in English language learners: A short report

The usefulness of teaching pronunciation in language instruction remains controversial. Though past research suggests that teachers can make little or no difference in improving their students’ pronunciation,  current  findings  suggest  that  second  language  pronunciation  can  improve  to  be near  native-like  with  the  implementation  of  certain  criteria  such  as  the  utilization  of...

متن کامل

The Prosody of Discourse Structure and Content in the Production of Persian EFL Learners

The present research addressed the prosodic realization of global and local text structure and content in the spoken discourse data produced by Persian EFL learners. Two newspaper articles were analyzed using Rhetorical Structure Theory. Based on these analyses, the global structure in terms of hierarchical level, the local structure in terms of the relative importance of text segments and the ...

متن کامل

Statistical evaluation of the influence of stress on pitch frequency and phoneme durations in farsi language

Stress is known to be an important prosodic feature of speech. The recognition of stressed speech has always been an important issue for speech researchers. On the other hand, providing a large corpus with the coverage of all different stressed conditions in a certain language is a difficult task. Farsi (Persian) has been no exception to this. In this research, our aim has been to evaluate the ...

متن کامل

Prosodic vs. segmental contributions to naturalness in a diphone synthesizer

The relative contributions of segmental versus prosodic factors to the perceived naturalness of synthetic speech was measured by transplanting prosody between natural speech and the output of a diphone synthesizer. A small corpus was created containing matched sentence pairs wherein one member of the pair was a natural utterance and the other was a synthetic utterance generated with diphone dat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999